Preprocessing Opportunities in Optimal Numerical Range Partitioning

نویسندگان

  • Tapio Elomaa
  • Juho Rousu
چکیده

We show that only the segment borders have to be taken into account as cut point candidates in searching for the optimal multisplit of a numerical value rangewith respect to convex attribute evaluation functions. Segment borders can be found efficiently in a linear-time preprocessing step. For strictly convex evaluation functions inspecting all segment borders is also necessary. With Training Set Error, which is not strictly convex, the data can be preprocessed into an even smaller number of cut point candidates, called alternations, when striving for optimal partition. Examining all alternations also seems necessary. We test empirically the reduction of the number of cut point candidates that can be obtained for Training Set Error on real-world data. The experiment shows that in some domains significant reduction in the number of cut point candidates can be obtained.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Solving Set Partitioning Problems with Constraint Programming

This paper investigates the potential of constraint programming for solving set partitioning problems occurring in crew scheduling, where constraint programming is restricted to not employ external solvers, as for instance integer linear programming solvers. We evaluate preprocessing steps known from the OR literature on moderately sized set partitioning problems. Further, we propose a new prep...

متن کامل

Refinement Tree Based Partitioning for Adaptive Grids

We present a new partitioning algorithm for grids obtained by adaptive reenement. The method uses the adaptive reenement tree to obtain information unavailable to other partitioning methods which use only the nal grid and/or some geometric data. The algorithm requires (typically) O(log(N)) operations after an O(N) preprocessing step. The method is guaranteed to produce perfectly balanced connec...

متن کامل

A Preprocessing Technique to Investigate the Stability of Multi-Objective Heuristic Ensemble Classifiers

Background and Objectives: According to the random nature of heuristic algorithms, stability analysis of heuristic ensemble classifiers has particular importance. Methods: The novelty of this paper is using a statistical method consists of Plackett-Burman design, and Taguchi for the first time to specify not only important parameters, but also optimal levels for them. Minitab and Design Expert ...

متن کامل

On Optimal End-to-End QoS Budget Partitioning in Network Dimensioning∗

We investigate the problem of optimal end-to-end QoS budget partitioning to quantify the advantage for network dimensioning of having a non-uniform allocation of end-to-end QoS requirement over the links in a path. We extend a previous revenue maximization model to M/G/1 queuing models and propose a fast partitioning heuristic based on the M/M/1 case. We then show on small networks with M/G/1 q...

متن کامل

Partitioning strategies for the Block Cimmino algorithm

In the context of the block Cimmino algorithm, we study preprocessing strategies to obtain block partitionings that can be applied to general linear systems of equations Ax = b. We study strategies that transform the matrix AA into a matrix with a block tridiagonal structure. This provides a partitioning of the linear system for row projection methods because block Cimmino is essentially equiva...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001